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Gene expression is controlled primarily by interactions between transcription factor proteins (TFs) 
and the regulatory DNA sequence, a process that can be captured well by thermodynamic models 
of regulation. These models, however, neglect regulatory crosstalk: the possibility that non-cognate 
TFs could initiate transcription, with potentially disastrous effects for the cell. Here we estimate 
the importance of crosstalk, suggest that its avoidance strongly constrains equilibrium models of TF 
binding, and propose an alternative non-equilibrium scheme that implements kinetic proofreading to 
suppress erroneous initiation. This proposal is consistent with the observed covalent modifications 
of the transcriptional apparatus and would predict increased noise in gene expression as a tradeoff 
for improved specificity. Using information theory, we quantify this tradeoff to find when optimal 
proofreading architectures are favored over their equilibrium counterparts. 

PACS numbers: 


In prokaryotes, transcription factors recognize and 
bind specific DNA sequences L = 10 — 20 basepairs (bp) 
in length, usually located in promoter regions upstream 
of the regulated genes [T]. Regulation by a single TF, 
or a small number of TFs interacting cooperatively, is 
sufficient to quantitatively account for the experimental 
measurements of gene expression [2], as well as to ex¬ 
plain how any gene can be individually “addressed” and 
regulated only by its cognate TFs [3], without much dan¬ 
ger of regulatory crosstalk. In eukaryotes, however, TFs 
seem to be much less specific {L = 5—10 bp; but the total 
genome size is larger than in prokaryotes by ~ 10^) OH], 
binding promiscuously to many genomic locations 0 , in¬ 
cluding to their non-cognate binding sites [6]. What are 
the implications of this reduced specificity for the preci¬ 
sion of gene regulation? 

Thermodynamic models of regulation postulate that 
the rate of target gene expression is given by the equi¬ 
librium occupancy of various TFs on the regulatory se¬ 
quence and the success of this framework in 

prokaryotes [S] has prompted its application to eukary¬ 
otic, in particular, metazoan, enhancers [inHIl]- To il¬ 
lustrate the crosstalk problem in this setting, consider 
the ratio a of the dissociation constants to a nonspe¬ 
cific and a specific site for an eukaryotic TF; typically, 
cr ^ 10^ (corresponding to a difference in binding energy 
of ~ 7 ksT) [51 [T5] . Because there are v ~ 10^ — 10^ 
of different TF species in a cell, TFs nonspecific to a 
given site will greatly outnumber the specific ones. For 
an isolated binding site, this would imply roughly equal 
occupancy by cognate and noncognate TFs, suggesting 
that crosstalk could be acute. For multiple sites, coop¬ 
erative binding is known for its role in facilitating sharp 
and strong gene activation even with cognate TFs of in¬ 
termediate specificity—but could the same mechanism 
also alleviate crosstalk? First, note that there exist well- 
studied TFs which do not bind cooperatively (e.g. [17]). 
Second, while many proposed regulation schemes give 


rise to cooperativity (e.g., nucleosome-mediated cooper- 
ativity [18], or synergistic activation m) they will not 
suppress crosstalk; for the latter, cooperativity needs to 
be strong and specific, stabilizing only the binding of cog¬ 
nate TFs. Third, even when cooperative interactions are 
specific, crosstalk can pose a serious constraint. Regulat¬ 
ing a gene implies varying the cognate TF concentration 
throughout its dynamic range, and when this concen¬ 
tration is low and the target gene should be uninduced, 
cooperativity cannot prevent the erroneous induction by 
noncognate TFs. For that, the cell could either keep 
the genes inactive by binding of specific repressors, or 
by making the whole gene unavailable for transcription. 
The first strategy seems widely used in bacteria but less 
so in eukaryotes; the second strategy (“gene silencing”) 
is widespread in eukaryotes, but only happens at a slow 
timescale and involves a complex series of nonequilibrium 
steps. 

Here we propose a plausible and fast molecular mech¬ 
anism which alleviates the effects of crosstalk; a detailed 
account of when crosstalk poses a severe constraint for 
gene regulation will be presented elsewhere. The pro¬ 
posed mechanism is consistent with the known tight con¬ 
trol over which genes are expressed in different condi¬ 
tions or tissues (e.g., during development m) on the 
one hand, and on the other, explains the high levels of 
measured noise in transcription initiation of active genes 

[min]. 

The simplest proofreading architecture for transcrip¬ 
tional gene activation that can cope with erroneous bind¬ 
ing is presented in Fig[T]\,B, motivated by a scheme first 
proposed by Hopfield [5D]. Specificity is only conveyed 
by differential rates of TF unbinding (“off-rates” kf, klff, 
with a = k'ff/k^). There are noncognate TF species 
whose typical concentration we take to be Cnc = 
and C is the maximal concentration for the cognate TFs 
Cc, Cc G [0, C]. The ratio K = v/a determines the sever¬ 
ity of crosstalk, which is weak for A ^ 1 and strong for 
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FIG. 1: A) A schematic of cognate (green circles) and v kinds 
of noncognate (various red shapes) TFs binding to a gene reg¬ 
ulatory element on the DNA (gray box), to control the mRNA 
expression level. B) Transition state diagram for the proof¬ 
reading gene regulation. The regulatory element can cycle 
between an empty state (O), state occupied by either cognate 
(ic) or noncognate (inc) TF; to initiate gene expression, a 
further non-equilibrium transition into “2” states (with rate 
1/g) is required, driven by, e.g., hydrolysis of ATP. mRNA 
is expressed at rate r and degraded with rate d, the slow¬ 
est process that sets our unit for time. In this figure we use 
r/d = 100,fcll7d = 2500, cr = = 50, A = v/a = 0.1; di¬ 

mensionless concentration is c = k+Cc/d. C,D) Steady-state 
mRNA distributions for low and high concentrations of the 
cognate TF, c. As qd —> 0 (C), the proofreading model re¬ 
duces to the two-state model of gene expression |22) : here, 
noncognate TFs initiate transcription at a high rate even 
when c is low, causing overlapping output distributions (blue; 
top) and small dynamic range (black line = (m(c)), blue shade 
= o'm(c); bottom). Proofreading (D) suppresses erroneous 
initiation, leading to separable output distributions (orange; 
top) and higher dynamic range (bottom). 


A ^ 1. The response of the promoter to the dimen¬ 
sionless input concentration c (= k+Cc/d, see Fig[^) of 
cognate TFs is captured by the steady state distribution 
of mRNA, P(m|c); the spread of this distribution is due 
to the stochasticity in gene expression, which includes 
random switching between promoter states and the birth- 
death process of mRNA expression [21]. If the reaction 
rates are known, P(m|c) is computable from the chemical 
Master equation corresponding to the transition diagram 
in Fig EP; using finite-state truncation, this becomes a 
linear problem that is numerically tractable. 

Figures and D each compare the steady state dis¬ 
tributions of mRNA at low and high concentration of 
cognate TF, c. The behavior crucially depends on the 




FIG. 2: A) Maximal information transmission (left axis, 
black) and the error fraction (right axis, gray) as a func¬ 
tion of the inverse irreversible reaction rate, qd. Increasing 
qd suppresses the error fraction, but only at the cost of in¬ 
creasing the gene expression noise, leading to a tradeoff and 
an information-maximizing value of q*d (orange). This max¬ 
imum is reached robustly with input distributions that are 
close to optimal (inset). B) Noise in gene expression, am. /{m), 
computed from the moments of P{m\c), as a function of the 
dimensionless input concentration c, for the optimal proof¬ 
reading (orange) and the two-state (blue) architectures. Dot¬ 
ted lines show the Poisson limit, cr^ = (^)i for comparison. 
In both cases, the average number of mRNA expressed if fixed 
to m = 100. 


out-of-equilibrium rate qd. When qd —>■ 0, the scheme 
of Fig becomes a normal two-state promoter as the 
states Ic and 2c (likewise Inc and 2nc) fuse into a sin¬ 
gle state. In this limit, the effect of crosstalk is highly 
detrimental already at A = 0.1 used in this example: at 
low c, the promoter repeatedly cycles through erroneous 
initiation and the gene is highly expressed both at low 
c as well as at high c (where most of the expression is 
indeed due to correct initiation); as a result, the distri¬ 
butions P{m\c) show substantial overlap in the two input 
conditions shown in Fig[^. In contrast, for a non-trivial 
choice of q {k‘L <C 1/q — fc7), the model can exhibit 
proofreading. Even at low cognate concentration c, the 
slow irreversible transition ensures that noncognate TFs 
unbind from the promoter and that erroneous initiation 
is consequently rare, which is manifested as a sharp peak 
of P(m|ciow) at small m in Fig[^. The proofreading ar¬ 
chitecture generates a larger output dynamic range and 
consequently makes the responses distinguishable. 

What are the costs to the cell of the proposed proof¬ 
reading mechanism? First, the mechanism requires an 
energy source, e.g., ATP, to break detailed balance. 
Whether such a metabolic cost is a burden to the cell 
is unclear: a few molecules of ATP paid per initiation 
should be negligible compared to the processive cost of 
transcription and translation. Second, however, is an in¬ 
direct cost in terms of gene expression noise. While proof¬ 
reading decreases erroneous induction, it takes longer to 
traverse the state transition diagram from empty state 0 
to expressing state 2, and since the promoter can perform 
aborted erroneous initiation cycles, the fluctuations in 
the time-to-induction will also increase [23] . This will re¬ 
sult in additional variance in the mRNA copy number at 
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steady state compared to the two-state {qd —>■ 0) scheme. 
While the speed/specificity tradeoff in protein synthesis 
has been examined before using deterministic chemical 
kinetics [5^, this stochastic formulation of proofreading 
has, to our knowledge, remained unexplored. Proofread¬ 
ing in gene regulation is thus expected to increase the 
output dynamic range, which is favorable for signaling, 
but also to increase the noise, which is detrimental. 

How can we formalize the tradeoff between noise and 
dynamic range for gene regulatory schemes and find when 
proofreading is beneficial? In existing analyses of proof¬ 
reading the erroneous incorporation of the substrate leads 
to an error product that is different from the correct 
one uniiis]; in contrast, here the gene always expresses 
the same mRNA. What is important for signal transduc¬ 
tion, however, is how well this expression correlates with 
the input signal, c. To quantify the regulatory power 
of the proofreading architecture, we computed the mu¬ 
tual information, I (c; m) [55] , between the signal c and 
the mRNA expression level m, following previous appli¬ 
cations of information theory to gene regulation HUHTj. 
The information depends not only on P{m\c), which we 
compute from the Master equation, but also on the a pri¬ 
ori unknown distribution of input concentrations, P{c)\ 
we therefore determined the input distribution P* (c) 
that maximizes information transmission, subject to a 
constraint on the average number of expressed mRNA, 
m = /dcP(c)E ^mP{m\c). This constraint on average 
number of mRNA was imposed to compare different regu¬ 
latory architectures; otherwise, higher average expression 
could yield higher information transmission for trivial 
reasons. Such constrained information (capacity) max¬ 
imization is a well-known problem in information theory 
that can be solved using the Blahut-Arimoto algorithm 

[53]. 

Figure [2]A shows how the information transmission 
/(m; c) through the promoter depends on the (inverse) 
reaction rate qd. We start by looking at the classic mea¬ 
sure of proofreading performance, the “error fraction,” 
i.e., the ratio of the mRNA expressed from state 2nc due 
to noncognate TFs, vs mRNA expressed from state 2c 
due to cognate TFs. As qd is increased, the error fraction 
drops, with no clear optimum. In contrast, there exists 
an optimal q*d at which the information is maximized— 
this is the point where proofreading is most effective, op¬ 
timally trading off erroneous induction (here, suppressed 
by a factor of ~ 30 relative to no proofreading), noise in 
gene expression, and dynamic range at the output. In 
Fig[^ we plot the noise in gene expression, as a func¬ 
tion of the input concentration c for the optimal proof¬ 
reading architecture and the non-proofreading limit. In 
both cases the noise has super-Poisson components due 
to the switching between promoter states, but this excess 
is substantially higher in the proofreading architecture, 
as expected. 

While attractive, these results still depend on the par- 



FIG. 3: A) Information advantage (in bits, color scale) of 
optimal proofreading over optimal two-state architectures, as 
a function of crosstalk severity A and dynamic range of in¬ 
put TF concentration, Cmax. Typical values for prokaryotes, 
yeast, and metazoans are marked in white. Lower inset: opti¬ 
mal rates, (black line = average over Cmax, gray shade 

= std), indicate a switch to the proofreading strategy. B, C, 
D) Cuts through the information plane in (A) along white 
dashed lines showing the collapse of two-state performance as 
log]^Q(A) —>■ 0 and a clear proofreading advantage for meta¬ 
zoan regulation. 


ticular rates chosen for the model in Fig HP- Surpris¬ 
ingly, if we choose to compare the optimal proofread¬ 
ing scenario with the optimal non-proofreading one, the 
problem simplifies further. Given that the input TF 
concentration c varies over some limited dynamic range, 
c G [0, Cmax = fc-i-C/d], there should exist also an opti¬ 
mal setting for kf-. set too high, the cognate TFs will be 
extremely unlikely to occupy the promoter for any signifi¬ 
cant fraction of the time and induce the gene; set too low, 
the switching contribution to noise in gene expression will 
blow up. With kf and q in the “correct initiation” path¬ 
way of Fig[^ set by optimization, the remaining rates in 
the “erroneous initiation” pathway are fixed by the choice 
of crosstalk severity A. The remaining parameters reg¬ 
ulating mRNA expression—the average mRNA count m 
and the rate r—do not change the results qualitatively. 
The mRNA expression rate r simply sets the maximal 
number of mRNA molecules at full expression in steady 
state {r/d)\ this influences the Poisson noise at the out¬ 
put, but does so equally for any regulatory architecture, 
proofreading or not. As long as r is large enough so that 
the average mRNA constraint rfi is achievable, the precise 
choice of these values is not crucial (we use r/d = 200, 
m = 100, plausible for eukaryotic expression). In sum, 
we can compare how well the optimal proofreading ar¬ 
chitecture does compared to optimal non-proofreading 
architecture in terms of information transmission, as a 
function of two key parameters: the crosstalk severity, 
A, and the input dynamic range, Cmax- 
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Figure]^ shows the advantage, in bits, of the optimal 
proofreading architecture relative to the optimal non¬ 
proofreading one. This “information plane,” /q*(TO;c) — 
Iq=o{m; c), is plotted as a function of A and Cmax- In the 
limit A —>■ 0, the difference in performance goes to zero: 
there, optimization drives q*k^'* ^ 1, but proofread¬ 
ing offers vanishing advantage over the optimal two-state 
promoter architecture when noncognate binding is neg¬ 
ligible. As A increases, proofreading becomes beneficial 
over the two-state architecture, and more so for higher 
values of Cmax- Higher input concentrations c S [0, Cmax] 
permit faster on-rates, resulting in faster optimal off rates 
/cl’* and faster optimal 1/q*. Generally, faster switch¬ 
ing of promoter states in Fig P means that promoter 
switching noise will be lower and thus information higher 
(at fixed mean mRNA expression m); in particular, opti¬ 
mization tends to minimize promoter switching noise by 
selecting the fastest 1/q that still admits error rejection, 
i.e., q*k//'’* ~ 1. At A = ^/(T ~ 1, the signaling capac¬ 
ity of the non-proofreading architecture collapses com¬ 
pletely, with Iq=Q{c;m) « 0 [32]. At this point optimal 
proofreading architectures are affected, but still gener¬ 
ally maintain at least half of the capacity seen at A = 0; 
proofreading extends the performance of the gene regula¬ 
tion well into the A > 0 region, before finally succumbing 
to crosstalk. 

Where do different organisms lie in the information 
plane? Prokaryotes have on the order of ~ 100 types 
of transcription factors, whose binding site motifs typi¬ 
cally contain around 23 bits of sequence information [3|, 
corresponding to the binding energy difference of 16 ksT 
between cognate and noncognate sites |2H|, and thus a 
specificity of roughly a ~ 10^. This corresponds to a 
small value of crosstalk severity, A ~ 10“®. For yeast, 
the typical sequence information is 14 bits (lOfcsT) [3|, 
which gives A ^ 0.01 (for v ~ 200 [29]). For multi¬ 
cellular eukaryotes, the typical sequence information is 
12 bits (SksT), and the number of TF species varies 
between i/ « 10^ (C. elegans) to « 2 • 10^ (human) 
m, putting A between 0.1 and 1. We can also estimate 
the dimensionless parameter Cmax = k+C/d. Assuming 
diffusion-limited binding of TFs to their binding sites, 
k+C/d « SDaN/R^d, where D ^ l/rm^/s is the typi¬ 
cal TF diffusion constant [3D], a^3 nm is the binding 
site size, R = 3 fim (1 /xm) is the radius of an eukaryotic 
nucleus (prokaryotic cell), and N is the typical copy num¬ 
ber of TFs per nucleus {N ^ 10 for prokaryotes, 10^ for 
yeast, 10^ — 10® for eukaryotes). Typical mRNA lifetimes 
are 5 — 10 min in prokaryotes, 20 — 30 min in yeast, and 
> 1 hour in metazoans. This yields Cmax of order 10 for 
prokaryotes, 10^ for yeast cells, and > 10^ for multicellu¬ 
lar eukaryote cells. While these are very rough estimates, 
different kinds of cells clearly differ substantially in their 
location on the information plane of Fig[^. 

Taken together, these values suggest that crosstalk is 
acute for metazoans and that proofreading in gene regu¬ 


lation could provide a vast improvement over equilibrium 
regulation schemes, as in Fig [^. In contrast, our pro¬ 
posal offers no advantage for prokaryotes, and remains 
agnostic about yeast (Figs|^, D). While much remains 
unknown about the molecular machinery of eukaryotic 
gene regulation, it has been experimentally shown that 
transcriptional initiation (not just elongation) involves a 
series of out-of-equilibrium steps. Amongst those, per¬ 
haps the most intriguing are the covalent modihcations 
on the eukaryotic RNA polymerase II CTD tail [3T]. The 
tail contains tandem repeats of short peptides (from 26 
repeats in yeast to 52 in mammals), which need to get 
phosphorylated in order to initiate transcription and sub¬ 
sequently cleared after completed transcription in order 
to reuse the polymerase; genetic interference with this 
tail seems to be lethal. One can contemplate a scenario 
where a sequence of such phosphorylation steps corre¬ 
sponds to the out-of-equilibrium reaction q of our simple 
proofreading scheme, “ticking away” time until the poly¬ 
merase commits to initiation, with every tick giving the 
machinery another opportunity to check if cognate TFs 
are still bound and, if not, abort transcription. The exis¬ 
tence of any such (or similar) proofreading scheme would 
be interesting, but is currently purely hypothetical. 

Why would eukaryotes employ a method of gene regu¬ 
lation so qualitatively different from prokaryotes, instead 
of simply using longer, specific binding sites that would 
drive crosstalk severity A towards zero? While beyond 
the scope of this work, one possible hypothesis is that 
such longer sites are not easily evolvable and, addition¬ 
ally, that the complexity of regulation calls for combi¬ 
natorial control of single genes by many TFs of differ¬ 
ent species, each of which could have weak specificity. 
Such cooperative or combinatorial control could indeed 
address a specific target gene uniquely, as proposed (e.g., 
El US]); what has largely been neglected in previous dis¬ 
cussions is that it would be difficult to prevent the tar¬ 
get gene from being erroneously induced by crosstalk. 
Here we advanced a possible hypothetical mechanism, 
proofreading-based transcriptional regulation, to miti¬ 
gate this problem. It is interesting to note that, unlike 
most biophysical problems where we clearly appreciate 
their out-of-equilibrium nature, transcriptional regula¬ 
tion has remained a textbook example of a non-trivial 
equilibrium molecular recognition process, likely due to 
the success of the equilibrium assumption in prokaryotes. 
Perhaps constraints imposed by crosstalk will motivate 
us to reexamine this assumption in eukaryotic regulation 
more closely. 

We thank TR Sokolowski and T Friedlander for helpful 
comments on the manuscript, and E van Nimwegen for 
suggesting that histone modihcation / remodeling might 
also constitute a candidate proofreading mechanism. 
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